How to Account for Idiomatic German Support Verb Constructions in Statistical Machine Translation

نویسندگان

  • Fabienne Cap
  • Manju Nirmal
  • Marion Weller
  • Sabine Schulte im Walde
چکیده

Support-verb constructions (i.e., multiword expressions combining a semantically light verb with a predicative noun) are problematic for standard statistical machine translation systems, because SMT systems cannot distinguish between literal and idiomatic uses of the verb. We work on the German to English translation direction, for which the identification of support-verb constructions is challenging due to the relatively free word order of German. We show that we achieve improved translation quality for verb-object supportverb constructions by marking the verbs when occuring in such constructions. Additional evaluations revealed that our systems produce more correct verb translations than a contrastive baseline system without verb markup.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Linguistic Evaluation of Support Verb Constructions by OpenLogos and Google Translate

This paper presents a systematic human evaluation of translations of English support verb constructions produced by a rule-based machine translation (RBMT) system (OpenLogos) and a statistical machine translation (SMT) system (Google Translate) for five languages: French, German, Italian, Portuguese and Spanish. We classify support verb constructions by means of their syntactic structure and se...

متن کامل

Nominalizations in German Support Verb Constructions

Nominalizations in German Support Verb Constructions Angelika Storrer TU Dortmund ___________________________________________________________________________ Support verb constructions (SVC), such as to take a risk or to set into motion, are composed of a support verb (SV, e.g. take, set) and a predicative noun (PredN, e.g. risk, motion), which is – in most cases – a nominalization of a verb or...

متن کامل

Issues in Translating Verb-Particle Constructions from German to English

In this paper, we investigate difficulties in translating verb-particle constructions from German to English. We analyse the structure of German VPCs and compare them to VPCs in English. In order to find out if and to what degree the presence of VPCs causes problems for statistical machine translation systems, we collected a set of 59 verb pairs, each consisting of a German VPC and a synonymous...

متن کامل

Project Proposal: How Verb-Particle Constructions Affect Translation Quality in English and German

This project deals with verb-particle constructions (VPCs) in the context of statistical machine translation, with the purpose of finding out if and to what degree they affect translation quality in English and German. Its main goal is to identify which types of VPCs cause translation errors and to find possible solutions to this problem. VPCs are a type of multiword expressions (MWEs), meaning...

متن کامل

Modeling verbal inflection for English to German SMT

German verbal inflection is frequently wrong in standard statistical machine translation approaches. German verbs agree with subjects in person and number, and they bear information about mood and tense. For subject–verb agreement, we parse German MT output to identify subject–verb pairs and ensure that the verb agrees with the subject. We show that this approach improves subject-verb agreement...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015